Accelerating Deep Neuroevolution on Distributed FPGAs for Reinforcement Learning Problems

نویسندگان

چکیده

Reinforcement learning, augmented by the representational power of deep neural networks, has shown promising results on high-dimensional problems, such as game playing and robotic control. However, sequential nature these problems poses a fundamental challenge for computational efficiency. Recently, alternative approaches evolutionary strategies neuroevolution demonstrated competitive with faster training time distributed CPU cores. Here we report record times (running at about 1 million frames per second) Atari 2600 games using implemented FPGAs. Combined hardware implementation console, image preprocessing network in an optimized pipeline, multiplied system level parallelism enabled acceleration. These are first application demonstration IBM Neural Computer, which is custom designed that consists 432 Xilinx FPGAs interconnected 3D mesh topology. In addition to high performance, experiments also showed improvement accuracy all compared same algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generative NeuroEvolution for Deep Learning

An important goal for the machine learning (ML) community is to create approaches that can learn solutions with human-level capability. One domain where humans have held a significant advantage is visual processing. A significant approach to addressing this gap has been machine learning approaches that are inspired from the natural systems, such as artificial neural networks (ANNs), evolutionar...

متن کامل

Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

Deep artificial neural networks (DNNs) are typically trained via gradient-based learning algorithms, namely backpropagation. Evolution strategies (ES) can rival backprop-based algorithms such as Q-learning and policy gradients on challenging deep reinforcement learning (RL) problems. However, ES can be considered a gradient-based algorithm because it performs stochastic gradient descent via an ...

متن کامل

Deep Learning on FPGAs

DEEP LEARNING ON FPGAS Griffin James Lacey Advisor: University of Guelph, 2016 Dr. Graham W. Taylor Co-Advisor: Dr. Shawki Areibi The recent successes of deep learning are largely attributed to the advancement of hardware acceleration technologies, which can accommodate the incredible growth of data sizes and model complexity. The current solution involves using clusters of graphics processing ...

متن کامل

Neuroevolution strategies for episodic reinforcement learning

Article history: Received 30 April 2009 Available online xxxx

متن کامل

Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm

: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Journal on Emerging Technologies in Computing Systems

سال: 2021

ISSN: ['1550-4832', '1550-4840']

DOI: https://doi.org/10.1145/3425500